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Abstract 

We propose a long term portfolio management method which takes into 
account a liability. Our approach is based on the LQG (Linear, Quadratic 
cost, Gaussian) control problem framework and then the optimal portfolio 
strategy hedges the liability by directly tracking a benchmark process 
which represents the liability. Two numerical results using empirical data 
published by Japanese organizations are served: simulations tracking an 
artificial liability and an estimated liability of Japanese organization. The 
latter one demonstrates that our optimal portfolio strategy can hedge his 
or her liability. 



1 Introduction 

In the management of pension fund, a long term portfolio strategy taking into 
account a liability is one of the most significant issue. The main reason is the 
demographic changes in the developed countries: if the working-age population 
is enough to provide for old age, the liability is a minor issue in the portfolio 
management. Since the life expectancy have increased in recent decades, it 
becomes insufficient to provide for old age. Furthermore the low birth rate 
continues and drives up this problem for decades. Thus pension funds face a 
challenging phase to construct long term portfolio strategies which hedge their 
liabilities. 
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A lot of pension funds except a few ones pQ determine their portfolio strate- 
gies by the traditional single time period mean variance approach which ex- 
cludes an evaluation of a liability. Its intuitive criterion attracts managers of 
pension funds. However the single time period approach is unsuitable for a long 
term portfolio management in the sense that it is unable to change the strat- 
egy excepting the initial time. The multi time period approach which arrows 
the change of the strategy has a problem that the computational complexity 
growths exponentially. Hence if we employ this approach, we are usually unable 
to obtain the optimal portfolio strategy in realistic time. 

Therefore the aim of this paper is to propose a long term portfolio strategy 
which (i) involves an evaluation of a liability, (ii) admits changes of the strategy 
at any time, and (iii) is obtained in realistic time. To tackle this problem, we 
employ the LQG (Linear, Quadratic cost, Gaussian) control problem (see, e.g., 
Fleming and Rishel 2 ). The LQG control problem is a class of stochastic control 
problem and is able to provide the control minimizing the mean square error of a 
benchmark process and a controlled process. Roughly speaking our tactic is that 
we compute the optimal portfolio strategy with the benchmark process which 
represents the liability. Then we can track the liability by using our optimal 
portfolio strategy. Although it is difficult to obtain the solution of stochastic 
control problem in general, the LQG control problem has the analytical solution 
which assures that we are able to obtain the solution in realistic time and thus 
it meets our purpose. 

A continuous time stochastic control approach is one of the most popular 
method to obtain the suitable long term portfolio strategy. The literature about 
this approach is quite rich. The papers treating managements of pension funds 
are, for instance, as follows: Deelstra et al. [3] and Giacinto et al. [4] discuss the 
portfolio management for pension funds with a minimum guarantee; Menoncin 
and Scaillet [5] and Gerrard et al. [6] deal with the pension scheme including 
the de-cumulation phase. Our study is on the cutting edge in the sense that deal 
with tracking liabilities directly and constructing a suitable long term portfolio 
at the same time. 

The organization of the present paper is as follows. We introduce continuous 
time models of assets and a benchmark in Section [2] To fit in the LQG control 
problem, they are defined by the linear stochastic differential equations (SDEs). 
We mention that our portfolio strategy is represented by amounts of assets. 
In Section [3] we define a criterion of the investment performance and provide 
the optimal portfolio strategy explicitly. Several numerical results are served in 
Section SJ Throughout the section the parameters related to the assets are de- 
termined by an empirical data provided by the Government Pension Investment 
Fund, Japan. The simulation using an artificial data are discussed in Section [JT] 
and this result gives conditions that our optimal portfolio strategy works well. 
Section B~2l provides the case study using an empirical estimations published by 
the Japanese Ministry of Health, Labour and Welfare. It demonstrates that our 
strategy is able to hedge the liability well. 
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2 Continuous time models of assets and a bench- 
mark 



In this section, we present mathematical models of assets and benchmark. The 
market which we are considering consists of only one risk-free asset and n-risky 
assets and we have m-benchmark component processes. 

Let (f2, J 7 , {J r t}t>o, P) be a filtered probability space {VFi} t >o be a d-dimensional 
Brownian motion where d = n + m and £ 2 (/iT x P) be a space of stochastic 
processes {Zt} t>0 which satisfy 



E 



Jo 



\Z t \ 2 dt 



< CO. 



We denote price processes of the risk-free asset, the risky assets and the bench- 



mark component processes by S^,S t = (Sj, ■ 



andF t - {¥?,■•■ ,Y t r 



respectively, here the asterisk means transposition. To fit in the LQG control 
problem, we assume that S^St and Y t are governed by the following SDEs: 



= 




5 ° = s e 



d -^ = b\t)dt + 

Sh = 4 G I 



1,2,- 



\dY t = {a{t)Y t + h(t)) dt + a Y (t)dW t , 
I Y = y G R"\ 



(1) 



(2) 



(3) 



where r : [0,T] -> R, b : [0,T] -> R", a s : [0,T] -> R" xd , a : [0,T] R™*™, 
h : [0,T] -> R m and cry : [0,T] ->■ R mxd are deterministic continuous functions 
and T < oo represents the maturity. Coefficients r, 6* and as stand for the 
risk-free rate and the expected return rate of the i-ih asset and the volatility. 
Let a class of portfolio strategy A be the collection of R"-valued /(-adapted 



process {u t } 



0<t<T 



which satisfies 



E 



Jo 



\u t \ 2 dt 



< oo, 



£ t e R" be the amount of the risky asset held by an investor at time t, and 
X t be the value of our portfolio at time t. Then the amount of the risk-free 
asset held by the investor is represented by X t — YH=i £*■ Hence {X t } < t < T is 
governed by 



-l 



SI 



x t 



-gQ, \&}0<t<T G A, 



(4) 



k X = x 



4h 
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where 1 = (1, • • • ,1)* € K n . To emphasize the initial wealth and the control 
variable, we may write X t = X^ '^ . 

The solution X t of the SDE Q is given by the following: 

f* 

X t = eti r{u)du x + / e^ r{u)du {b{s) -r{s)l)*i s ds 
Jo 

Jo 

Moreover since r, b, and 175 are continuous functions on [0, T\ and £ G A, X t is 



in £ 2 (p T x P): 










E 




< E 




Jo 


+ 



E 



{.Jo e ^ r(U)dU ^ -ri^YLds^J dt 



T / ft \ 2 

eI> r W du ga s (s)dW s ) dt 
\Jo J 



< TK xl + T 2 K^ 



£ll*6da 



-TK 2 E I £ll*£ 8 ds 
Jo 

< 00, 

where Kq, K\ and K2 are constants. 

3 Optimal investment strategy 

We define the criterion of investment performance J by 



At] = E 



71 (a{t)*Y t - X?°<^ dt + 72 ( 



A*Y T - X 



, C 6 A, (5) 



where 71 > and 72 > are constants, A 6 R m is a constant vector, and 
a : [0, T] — > M m is a deterministic continuous function. Hence our investment 
problem is to find the control £ s.t. J[£] < </[£], £ G A Since the performance 
criterion is represented by quadratic functions, our investment problem become 
the LQG control problem. We determine a(t), At and the parameters of Y t to 
be able to regard a(t)*Y t and A T Yr as a liability. 

The optimal portfolio strategy is represented in the following form: 
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Theorem 3.1 We define the portfolio strategy £ as follows: 

i 



2F 00 (t) 



(as^asit)*)- 1 (6(t) - r(t)l)(2F 00 (t)X t + 2F°*(t)Y t + G°(t)) 



-2a s (t)aUt)F°(t) 



(6) 



where F 0Q ,G° : [0,T] -> R and ^ : [0,T] 
ordinary differential equations (ODEs): 

d 



are solutions of following 



dt 



F w {t)+ ll + 2r{t)F w {t) 



-(b(t) - r(t)iy(as(t)as(t)*)-Hb(t) r(t)l)F 00 (t) = 0, 



(7) 



[F™(T) =72, 



(8) 



(9) 



|#°(t) - 7l a(<) + r(t)#°(t) + a(i)*F°(f) 

-(6(f) - rWlJ^asW^Ci)*)- 1 ^^) - r(t)l)F°(t) = 0, 
F°(T) = -2 l2 A, 

4<3°(*) + r(t)G Q (t) + 2h{t)*F°(t) 

-(6(t) - r^'Mt^*)*)-^*) - r(t)l)G°(t) 

-(6(f) - r^lJ'^sW^W*)- 1 ^^)^ (t)F°(t) = 0, 
G°(T) = 0. 

Then £ satisfies £ £ .4 and J[£] < J[£], £ G .4. 

The proof of Theorem 13.11 is given in the appendix. 

We note that £t has feedback terms of X t and Y t . This implies that our 
optimal strategy has delays to catch up the the benchmark process a(t)*Y t . 
Hence the preferable situation applying our strategy is the case that a(t)*Y t 
does not fluctuate violently. 



4 Numerical results 

We apply our method to an empirical data provided by the Japanese organi- 
zations. This section is divided to two subsections according to the type of 
liabilities, an artificial liability and the liability constructed by the estimations 
published by the Ministry of Health, Labour and Welfare of Japan. The former 
one suggests the situation that our optimal strategy works well and the latter 
one demonstrates that our portfolio strategy is able to hedge the liability. 

Before we move on the each subsection, we determine the common param- 
eters in following subsections. The first task is to determine the parameters 
relating to the benchmark component processes. They consists of the income of 
a pension fund Ct and his or her expense B t and thus n — 2 and Y t = (Ct,B t )* . 
We set the parameters constructing the benchmark process as follows: 

7i=72 = l, a(i) = (-l,l)*, A =(-1,1)*. 
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Hence the benchmark process is B t — Ct which represents a shortfall of the 
income and then we regard this shortfall as the liability. To discuss the perfor- 
mance of the strategy, we introduce a hedging error function of the i-th sample 
path El and its average E t as follows: 

N pi 

Ei=\(B t -C t )-Xt\, Et = Y,^> 

1=1 

where X\ is the i-th sample path of X t and N £ N is the number of the sample 
path. We set TV = 1000 except as otherwise noted. 

The next task is to determine the risk-free rate and the expected return 
rates and volatilities of risky assets. We invest the following four assets: indices 
of the domestic bond, the domestic stock, the foreign bond and the foreign 
stock; we number them sequentially. According to the estimations of return 
rate and volatilities by the Government Pension Investment Fund, Japan [7], 
we construct b(t) and a s (t) as follows: b x (i) = 3%, b 2 (t) = 4.8%, b 3 (t) = 3.5% 
and b A {t) = 5.0%; 

a«(t)=\fi iJ£{h-,n}, (10) 
I otherwise, 

where £ the Cholesky decomposition of S, a variance-covariance matrix of the 
assets: 

/ 0.00297025 0.0018189375 -0.000439488 -0.0005409125\ 
0.0018189375 0.04950625 -0.00777504 0.0119248875 
-0.000439488 -0.00777504 0.01806336 0.01467312 
\-0.0005409125 0.0119248875 0.01467312 0.03940225 / 

We choose a money market account as the risk-free asset and we set r(t) = 0.0%. 
4.1 Simulation with an artificial liability 

In this subsection we consider the following an artificial deterministic liability 
model: 

(dC t = omc t dt, 

\ C = 80 [trillion yen], 1 ' 

{dB t = 0MB t dt, 

| B = 100 [trillion yen], 

i.e., we set a v (t) = 0.01^- and h(t) — 0. We assume that our wealth coincides 
with the benchmark at the initial time: Xo = Bq — Co- We construct the 
optimal portfolio strategy over three decades, i.e., T :— 30 and we determine 
the functions F 00 , F° and G° by solving the ODEs ©-© numerically. Then 
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we simulate N paths of (S t7 Y t ) on [0,T] according to equation (P)-© using a 
standard Euler-Maruyama scheme with time-step At = 0.25. Figure [T] describes 
an investment result of a sample path. The black and red lines in FigureQ] 
represent B t — Ct and X t respectively. 




Figure 1: A sample path of B t — Ct 
and X t . The black and red lines rep- 
resent B t — Ct and X t respectively. 



Figure 2: Averaged hedging error E t 



The most significant issue it indicates is that the performance of the strategy 
is quite poor near the maturity. Figure [2] describing Et implies that this poor 
performance does not depend on the sample path. Figure |3] suggests a key factor 
of this phenomenon: values of functions F 00 , F° and G° change drastically 
between t = 25 and t = 30; this time period coincides with the term the 
hedging error becomes large rapidly. Figure [3] also implies that the existence 
of the stationary solutions of the ODEs ([T])-©. As described in Figur^l the 
strategy relatively works well on the time period when the functions F 00 , F° 
and G° reach the stationary state. Hence the strategy will be improved by using 
the stationary solutions of the ODEs ©-(0 on entire region. 
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Figure 3: The time evolution of F 00 , F° and G°. The black, red, green and 
blue lines represent F 00 , F 01 ,F 02 and G° respectively. 



To obtain the stationary solutions of the ODEs (JT])-©, we replace T to a 
value large enough. We denote it by T and set T — 50. Figure^] shows values 
of F 00 , F° and G° obtained by solving the ODEs ©-© with parameter f. 
We can find that the functions F 00 , F° and G° take the stationary solutions on 

[o,n 
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Figure 4: The time evolution of F 00 , F° and G° (improved case). The black, 
red, green and blue lines represent F 00 , p 01 ,F 02 and G° respectively. 



Results of simulations using the improved strategy are described as follows. 




Figure 5: A sample path of Bt — 
Ct and X t (improved version). The 
black and red lines represent B t — Ct 
and Xt respectively. 



Figure 6: Averaged hedging error Et 
(improved version). 



Figure [S] and Figure [B] indicate that the performance near the maturity is 
improved and it does not depend on the sample paths. This result leads us to the 
conclusion that we should construct the strategy with the stationary solutions 
of the functions F 00 , F° and G° if they exists. 

At the end of this subsection, we mention about our portfolio composition. 
Figure [7] displays the asset allocation on the sample path described in Figure 
Money market account, domestic bond and foreign stock indicated by light 
blue black and blue lines respectively dominate our portfolio. The optimal 
strategy is that we keep the most part of the wealth as money market account 
and compensate for the increment of the benchmark by the investment for the 
domestic bond, low risk and low return asset, and the foreign stock, high risk 
and high return asset. If Xt is deficient in Bt — Ct, the strategy increases the 
proportion of the domestic bond and the foreign stock. 
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Figure 7: An amount of each asset on the sample path described in Figure 
The black, red, green, blue and light blue lines represent the amount of a 
domestic bond, a domestic stock, a foreign bond, a foreign stock and money 
market account respectively. 



4.2 Simulation with an empirical liability 

According to the Japanese actuarial valuation published in 2009 [5], estimated 
income and expense summed up the national pension and the welfare pension 
are showed in the figure [8j 



Figure 8: Estimations of income and expense of Japanese national and welfare 
pensions. The black and red lines represent estimations of their income and the 
expense respectively. 

We regard these estimations as C t and B t and simulate the three decades 
investments using our optimal strategy from 2040 when the shortfall of the 
pension fund starts to expand drastically. The following reasons support that 




2020 2040 2060 2080 2100 
[year] 
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this situation is a valid case study: (i) a phase expanding B t — C t , the shortfall of 
the pension fund, is the most typical one expressing the demographic changes; 
(ii) the behaviour of B t — Ct in this term meets the condition to apply our 
optimal strategy: B t — Ct is increasing in the entire region. Throughout this 
subsection we set the start point as the year 2040, i.e., t = and t = 15 represent 
the year 2040 and the year 2055 respectively. 

To construct the optimal strategy, we first calibrate a(t), ay{t) and h(t) to fit 
the estimations. Setting a(t) — <7y(t) — and h(t) as a numerical differentiation 
of the estimations is a simple method to accomplish the purpose. Since we are 
discussing the three decades portfolio, we determine T — 30. As suggested in 
Section l4TTl we set T = 50 to obtain the stationary F 00 and F°. We are unable to 
expect the stationary G° because h(t) explicitly depends on t. We assume that 
our wealth coincide with the benchmark at the initial time: Xo = Bq — Cq. Then 
we simulate N paths of (S tl Y t ) on [0,T] according to equation (P)-© using a 
standard Euler-Maruyama scheme with time-step At = 0.25 which means that 
we can rearrange our portfolio every quarter. Results of the simulation are as 
follows. 




Figure 9: A sample path of B t — Ct 
and X t . The black and red lines rep- 
resent B t — Ct and X t respectively. 



Figure 10: Averaged hedging error 
E t . 



We are able to argue that our strategy hedges the shortfall well since Figure 
[TU] suggest that E t , the averaged hedging error, is approximately 3% of B t — C t , 
the shortfall, in every quarter. 
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Figure 11: An amount of each asset on the sample path described in Figure 
The black, red, green, blue and light blue lines represent the amount of a 
domestic bond, a domestic stock, a foreign bond, a foreign stock and money 
market account respectively. 

Figure [TT] displays the asset allocation on the sample path described in Fig- 
ure [9] In the same manner as in the case of the artificial liabilities discussed 
in Section 14. 1\ our optimal portfolio is dominated by the money market ac- 
count, the domestic bond and the foreign stock. However the proportion of 
the domestic bond and the foreign stock is much higher. We can understand 
this phenomenon intuitively: since the shortfall increases more rapid than that 
discussed in Section I4.1[ the hedging portfolio is rearranged to become more 
profitable. The practical suggestion from this fact is that we have to take a risk 
to track the increasing liability and this is quite natural. 

5 Summary 

We have proposed a long term portfolio management method which takes into 
account a liability. The LQG control approach allows us to construct a more 
suitable long term portfolio strategy than myopic one obtained by the single 
time period mean variance approach in the sense that we are able to change 
the strategy at any time. Our optimal portfolio strategy hedges the liability 
by directly tracking the benchmark process which represents the liability. The 
strategy is evaluated by the mean square error from the benchmark and hence 
it is intuitive. Two numerical simulations are served: the former one suggests 
the situation that our portfolio strategy works well; the latter one provide the 
result with the empirical data published by Japanese organizations. The result 
demonstrates that our portfolio strategy is able to hedge the liability, the short- 
fall of the income summed up the national pension and the welfare pension, over 
three decades. 

This study leaves ample scope for further research. Since our criterion is the 
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mean square error, our portfolio strategy inhibits that our wealth exceeds a lia- 
bility. This is the similar problem with the traditional mean variance approach. 
One of approaches to avoid it is that we extend criterion which is able to hedge 
only the case that our wealth goes under the liability. Then we again face the 
problem of computability as mentioned in the introductory section. 
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A Proof of Theorem 13.11 



V t (x,y) = infE 



The value function corresponding to our problem (JSJ is defined by 

' T 7 i (a( S )*Y s - X^) 2 ds 

+72 (A*Y T - X^) 2 •« =x,Y t =y 

Hence the corresponding Hamilton- Jacobi-Bellman (HJB) equation is given by 
inf ' {d t V t (x,y)+^V t (x,y)+ 11 (a(t)*y-x) 2 } = 0, (13) 

with terminal condition Vr(x,y) = 72(^*1/ — x) 2 , (x,y) elx R m , where dt 
is partial differential operator with respect to t and Jz?^ is the infinitesimal 
generator of the process (X t ,Y t ): 

&4>{x, y) = (r(t)x + (b(t) - r(t)l)*&) d x cj>(x, y) + (a(t)y + h(t))* d y <f>{x, y) 
+1 [^as(t)as(tr^dlcl>(x,y)+2^as(t)a Y (t)*d y d x <p(x,y) 
Tr (<T Y (t)* Y (t)*d2ct>(x,y))] 

for <j) & C 2 (tx R m ). Here d x and d 3 y are the j-th order partial differential 
operators with respect to x and y. As <Js(t)as(t)* is positive definite, the 
infimum of (|13|) is attained at 



-1 



(a s (t)a s (t)*) [(b(t)-r(t)l)d x V t (x,y) 

+as(t)a Y {tyd y d x V t (x,y)] 
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and hence (fT5|) can be written as 

d t Vt(x,y)+r(t)xd x Vt(x,y) + {a(t)y + h(t))* d v V t {x,y) 

1 (b(t)-r(t)l)%as(t)a s (ty)-\b(t)-r(t)l)(d x V t (x iy )y 



2d%V t (x,y) 



^^ (b(t)-r(t)l)%a s (t)a s (tn-^s(t)a Y (trd y d x V t (x,y) 
(dyd x V t (x,y))* cr Y (t)a s (t)*(T S (t)aY(tydyd x Vt(x,y) 



+c (x 2 - 2xa(t)*y + y*a(t)a(t)*y) 

= 0. 

Let us try a value function of the form 

V t (x, y) = F 00 (t)x 2 + 2xF°(t)*y + y*F(t)y + G°(t)x + G(t)*y + g(t), 

where F 00 ,G°,g : [0,T] -> R, F°,G : [0,T] -)• M m and F is a time-dependant 
symmetric matrix. It is straightforward to see that 

-F 00 {t)+c + 2r{t)F 00 {t) 

-(b(t) - r(t)iy(a s (t)<T S (ty)-i(b(t) - r(t)l)F 00 (t) = 0, ( 14 ) 
F 00 (T) = C, 

jF°{t) - ca(t) + r(t)F°(t) + a(t)*F°(t) 

-(b(t)-r(t)l)*(a s (t)a s (ty)-\b(t)-r(t)l)F°(t)=0, ( 15 ) 
F°(T) = -2CA, 

^G°{t) + r(t)G°(t) + 2h(t)*F°(t) 
at 

-(b(t) - rmyiasi^asityrHHt) - r(t)l)G°(t) (16) 
-(6(t) - r^iyias^asityy^s^aY^F^t) = 0, 
[G°(T) = 0, 

|F(t) + ca(t)a(ty + 2a(t)*F{t) -±^F°(t)F°(ty = 0, 
k F(T) = CAA*, 

^G(t) + r{t)G(t) + a(t)*G(t) + 2h(t)*F(t) 
at 

-^J^m-ritnyias^asityr^m-rimP^t) 

- Kt)l)> s (i)<r s (i)*rVsM^(^° (i))#°(t) = 0, 

lG(T) = 0, 
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r d 



-g(t) + h(t)G(t) + Tv(a Y a* Y F(t)) 

2 

-^4fy( & w - rmnas^asitrr'm r (<)i) 

G° 
~2F 00 (i) 
= 0, 



(19) 



(b(t) - r(t)l)*(as(t)as(t)*)- X <Ts(t)a Y (t)F (t) = 0, 



and then the associated candidate strategy is represented by 
-1 



6 



2F 00 (t) 



(as&asityy 1 (6(t) - r(t)l)(2F 00 (t)X t + 2^°*(t)Y t + G°(f)) 



-2a s (*)<4(i)J*(t) 



(20) 



Since (fT3| and (fT5j) are linear ODEs, they have unique solutions on [0,T]. The 
existence of unique F°(t) suggests that (flo]) and (fT?)) are linear ODEs. Hence 
(fTB]) and (jTTf also have unique solutions on [0, T]. In the same manner, the 
existence of unique solutions of (fT5|) and ([T9"]) are guaranteed. Therefore ^ 6i 
since X t and Y t in C 2 (pt x P)- 

We now start the verification, i.e., we show that V~o(xo,yo) < J[d], £ G -4 
and Vo(xo, j/o) = To this end, we introduce a sequence of stopping times 

{n} leN s.t. 



n=m£iu>0: / \X t £ t \ dt>l, \Y t £ t \ dt>l 



Applying the Ito formula to Vtati (^tat, , Y^at, ) and taking expectation, we 
have 



V6(x ,yo) = E 



TAr, 



(-9eVt(X t ,y t ) -J2? ? Y t (X t ,Y t )) dt + V T (X TATl ,Y TATl ) 

TAT, 

^Vt(X t ,Y t )e*o- s (t)dWi 



-E 



TAT, 



(d x Vt(X t ,Y t )) a Y (t)dW t 
As X( and Y t are in C 2 (^t X P) and cry, F°, i* 1 and G are continuous functions 
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on [0,T], the last term vanishes: 



r TAn 



(d y V t (X t ,Y t ))* a Y (t)dW t 



= 21 



TAr, 



X t F°(t)*a Y (t)dW t 

TAr, 

Y*F(t)a Y (t)dW t 



-E 



TAr, 



G(t)*a Y (t)dW t 



0. 



By the definition of 77, the continuity of the functions F 00 , F° and G°, and the 
fact that £ £ A, the remaining stochastic integral term also vanishes: 



E 



TAr, 



d x Vt{X t ,Y t )ga s (t)dW t 



TAr, 



+2E 



F w {t)X t C t a s {t)dW t 

TAr, 

F (t)*Y t £as(t)dW t 



TAr, 



G°(t)tf*s(t)dW t 



= 0. 

Since 77 00 when I goes to infinity, we get 

i-T 



Vo(x ,y Q ) = E 







-dt V t (X t ,Y t )-S*Vt (X t ,Y t ))dt + V T (X T , Y T ) 



By the HJB equation (IT5|) and its terminal condition, we obtain 



V (x ,yo) < 



7 i(a(i)*y - xYdt + l2 {A*y - xf 



which means that Vo(xo,?/o) 5: £ € A. In the same manner we find that 
Vo(%o , Do) — J[^} and then the claim is established. 
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